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ABSTRACT 


This thesis outlines a new way of presenting the theory of canonic 
systems, including a distinction (for methodic reasons) between simple 
canonic systems and general canonic systems, and proves a series of re- 
sults on hierarchies of canonic systems. After a brief summary of Doyle's 
results on a partial hierarchy of canonic systems, a new hiererchy is 
developed (Chapter II) which relates the general canonic systems not only 
to all 4 types of formal grammars defined by Chomsky but also to eny class 
of formal grammars definable in terms of productions. It is also shown 
(Chapter III) that all attempts to define a mathematical system which ex- 
actly corresponds to the recursive sets are necessarily fruitless. 
Doyle's work on how to define "noncontracting canonic systems with predi- 
cates of degree 2" (NCST) is continued, arriving at a workable definition 
which permits us to prove [NCST] = [Type 1] (Chepter IV), a conjecture 
put forth at the IIId Princeton Conference on Information Sciences and 
Systems. This results transforms Doyle's hierarchy from "the union of 
two half-hierarchies and a dangling term (the NCST)" into a complete hi- 
erarchy of canonic systems (all 4 types represented). However, this hi- 
erarchy is heterogenous: canonic systems corresponding to grammars of 
types 3 and 2 use only predicates of degree 1, while canonic systems cor- 
responding to grammars of types 1 and 0 use also predicates of degree 2; 
moreover, for all types of grammars except for context-sensitive grammars 
the canonic systems turned out to be simple. Schematically, the fom 
of this hierarchy may be summarized as . 


S1 S1 G2 §2 (for types 3, 2, 1,0) . 
We first show (Chapter V) how to get a hierarchy of simple canonic system, 
Sl sl S2 S82 ’ 


‘using as base Doyle's hierarchy, and then transform it. into 

Sl S1 S2 Sl 
Since this hierarchy does not seem to lend itself to further homogeni- 
zation", we shell use the hierarchy of Chepter II to obtain « hiererchy 
of simple canonic systems with predicetes of degree 1: 

$1 S1 Sl Sl 
Several new classes of canonic systems (non-crossreferencing, non-insert- 
ing, and pure canonic systems) are introduced in Chapter VI, where their 
properties are explored, and a classification schema and several hierar- 
chies are developed. 
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CHAPTER I 
SIMPLE AND GENERAL CANONIC SYSTEMS 


This chapter presents the differences between the traditional defi- 
nitions and the ones we will use, and builds the theory of canonic ays- 
tems according to the new specifications. It also includes the motiva- 
tion for the reorganization of 'canonic systems, | . 

. Canonic systems were first defined in Donovan 1966 . The seaveing 
point of our work was the version presented in “Donovan and Doyle 1968, 
pp- 3-9. The reader is assumed to be acquainted with this work, and | 
therefore we will not repeat that: definition but rather preenar the 


present Gaiy the modified definition. 


A canon used to be defined as a List of ss gements followed ‘by the 
a : 
sign k and then followed by a statement, where a statement |(tradition~ 


ally called 'remark') is composed of otem of some » degree fol lowed by 
a predicate of the same degree. A term of degree n is an ne tupte of 
arbitrary concatenations of Gettaties ead worden the Avan alphabet, 
the words surréunding the variables being retested to as the context 
of the variables. A Gerticular case was singled out, ‘the cask: Waa | 


context is actually indicated, jad the canonit systems satiefying this 


- condition, i.e. canonic systems which contain at least one canon in 


which there is an instance of vartibiss and symbols Scbcateaated together 
in the same term, were called canoni¢ systems with indicated. context | 


(CSwIC). [Donovan end Doyle 1968, p. 28; ‘Raggerty 1969, p..41]}, but not 


Sey ea er kar ce aE ES 


much was known pbout them beyond the observation that they appear to be 
rather powerful. Most classes of canonic systems encountered in the 
course of research were not "canonic systems with indicated context" in 
the sense of the old definition mentioned above; moreover, in all cases 
but one, constructive proofs for the existence of canonic systems with a 
certain property yielded canonic systems which were not "with indicated 
context", and the same holds for Alsop's “canonic translator" [Alsop 
1967] . Because of these, and especially in view of Haggerty's recent 
‘result [Haggerty 1969] that contextual indications can be dispensed with, 
we have decided not to regard as a distinguished class the class of cano- 
nic systems which do exercise the option of indicating Sontaxt, but rather 
to distinguish the class of canonic systems which have no such option 
available, and call them simple canonic systems s while the unrestricted 


ae Rawnweae 


general _canonic systems or as canonic_systeus with indicated context ~. 
[Therefore the new meaning of this term is that we have the option of in- 
dicating contextual conditions, and nothing more than the option, in con- 


trast with the old meaning, which required us to exercise this option in 


at least one canon.] 

The situation is intinc to that encountered in automata theory, in 
connection with the definition of Sondaterainistic-autece ta. The old de- 
finition of canonic systems with indicated context corresponds to the 
machine! [NIM] : "A NIM is a TM in which there is at least one state sa- 
tisfying the condition that for at least one symbol of the tape alphabet 
there are two or more quadruples {or quintuples, if we work with quin- 


tuples] in the specification of the TM" . According to this definition, 


deterministic TM were not particular cases of NIM but constituted a class 


“w ~ ee a S n>, Sak’ "jee enemies aa’ >. ee | st 2 =i -¢ SH 3 a 2 a.@m=e ee, 
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OLD NEW 


CANONIC SYSTEMS CANONIC SYSTEMS = 
= GENERAL CANONIC SYSTEMS = 
= C.S. WITH INDICATED CONTEXT 


Figure 1 


Graphic representation of the changes in terminology 
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The circles represent particular examples of canonic systems. 


would not be fortunate, and, in fact, this is not the definition of nondeter- 
ministic Turing machines; as everybody bie: vi then ane deterministic TMs 
sexs singled rane ee distinguished) as o partioujar, cose of NIMs. The new 
definition of canonic systems git cead eared canta ane ene introduction of 


the simple canonic systems were necessitated in order to "normalize" the 


usage in canonic systems, to switch from @ nomenclature corresponding to 


the hypothetical definition.of NIM, in que-exanple, to a nomenclatuce 
which corresponds to the true definition of WIM. | . 

Similarly, instead of talking of canonic systems: wah insertion, ger 
of canonic systems with crosampferencing, etc., we would single. out the 
eanonic systems without insertion, or wgthout crossreferencing, ates 
These classes of canonic sjated will be int rogaced and. studied in 
Chapter VI. a. 7 

The way in which we chose to "implement" this reorgepication is by. 
introducing Patera. ("premise ternal) and. thetr- see avo with . 


tems and their liste, ‘and > 


p-term ‘ie en n-tuple each of whose elements ie 4 "pare" concatens- : 
tion (containing either exclusively variables or exélustvely syabols) 


, This, incidentally, also eliminates the recursion on tem » 80 that 


so metee PE SRI S 


it will no longer the case that a substring « ofa | tern bad ‘automaticnlly, 


itself a tern. 


We are now ready to present the definition of simple canonic systems. 


Definition 1. A simple canonic system (of level i ) is a septuple 


ee re Pan ee Pe ee ne 
¢ CO MM Pe Ms GD 


where 
Cc is a finite set of ‘ae ee of snEGrence); 


is the alphabet used to form the strings generated by bs 


ae 


M is a finite set of yari Le “used to » stand for elements 


used to usme sets of tuples. 
the tuples is the degree . 


P is a finite set of p 
The number of 


See chigee ym RE 


figs Sa ARE ME 
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S; is a finite set of punctuation signs; 

D; (GePy) is a set of sentence predicates whose union will 

be defined to be the language specified by the 
canonic system. 

Gn is the "object" canonic system... 

This definition is not complete until we say what the canons, 
variables, predicates are and what we can do with them. 

However, since the reader is assumed to be familiar with these 
concepts, these will not be repeated here. Most of the differences 
have been outlined above, and a formal definition, using second- 
level canonic systems, will. now be given. The reader is urged to 
compare it with the old definition of canonic systems [Donovan 4nd 
Doyle 1968, pp. 10-18], to get a complete and accurate image of the changes 
that were introduced. In order to facilitate the comparison, our 
exposition will also be given by way of an example, and will use the 
same example, a canonic system defining the set of numbers composed 
of the digits 1,2 and 3. “Moreover, the drawing on page 5 of the 
above-mentioned work is presented below in a updated form as Figure 2 
to provide a quasi-pictorial representation of some of the changes 


introduced. General canonic systems are defined similarly, but 


—_—eee ee ee 


the conclusion but also in the premises. 


) 
1 0 
where C, [- 1 aigit 
L- 2 digit 
L 5 digit 
x digit L- X number 
x digit ; y number L- yx number 
WS Se Se 5) 
1 
Me Sd oe ge OF 
1 
P = { digit, number } 
1 
Sy. {3,5} 
Dd = { number } 
bo = ( &, x, 8, 8, &, B, & ) 


The following parse of the fifth canon of this system illustrates 


the metalanguage used to describe canons. 


BEIT BERG Utes LS i 8 RE apoE Ra i A as meas Sok AAA 92 ne SF RSE Ds PS Ca RET AMER DT ETT RE ERA ST 


digit 


conc. of var. 
‘Seca eeeeans 


\ premises 


of legai canon format 


loseieemneithdnenss oe ees en ciandndiiabiisacemeeeanemesianiaes aes cmmedieeameaineen” 


Figure 2 
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The second-level canonic system is a 7-tuple 


G, = Cy. Vas Mgr Poe Sp, 6, D 


where 
CF { the canons listed on the following pages } 
iP ae of. 232 Ss digit, number, x, y, ; , }- } 
My = {q, r, s, t, u,v, w } 
R = { predicates as defined in the cations } 


s, ef 3s ,.<.>, eo 


Dd, = { legally defined string } 
—————— ee 
G, is the first-level canonic system 


The canons of the second level must formally define the 
metalanguage and operations. of the first level; these canons are 
presented on the following pages with a brief discussion of the 
motivation and use of some of the canons. The particular manner 
in which we have constructed the second-level canons system 
allows this system to define other canonic systems with only 
slight modifications, which include, mainly, canons which 


define the set of canons of the system being defined. 


(1.1) bm Symbol 
(1.2) fmm 2 symbol 
(1.3) b=s symbol 


(2.1) bm: sign 


(2. 
(3. 
(3. 
(4. 
(4. 
(4. 
(5. 
(5. 
(6. 
(6. 
(6. 
(6. 
(6. 
(6. 


(6. 


(6. 


(6. 
(7. 
(7. 
(8. 
(8. 


(8. 


no OW wa NO eM HNO Be WM DH mm NN i) 
ee 


™“ 
~ 


8) 


9) 
1) 
2) 
1) 
2) 


35): 
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X variable 


y variable 


py 


digit predicate 


number: sepsenge predicate 


word ( A is the null string) 


u symbol ;; v word fee uv word 


u yariable peu concat, of var. 

u  concat. of var. 3; Vv vari able ad uv concat, of var. 

u concat. of yar. b= u p-term | 

u word feu poterm 

u variable feu concat, of var, § words 

u word pu concat. of var, § words 

u concat. of v words ;; v variable }euv concat. 
of var, §& words 

u  concat. of v 33 Vv Word fe uv concat. 
of var, §& words 

u jr u term 

t poterm 3; u predicate l= tu premise 

t term ;;u predicate fe tu statement 

A list of premises 

u 


list of premises ;; v premise bruv; list of prem. 


A list of statements 


14 


(8.4) u list of statements ;; v statement euv; list of 
statements o 


For efficiency's sake, one might add 


(7.0) u premise f= u statement 
(8.0) u list of premises fxu list of statements 


Note especially the intuitive meaning of p-term : a p-term is 
either a concatenation of variables or a single word (in V*). A 


term is an arbitrary concatenation of words and variables. The 
difference between premise and statement is that premise does not 

allow concatenations of variables and symbols (hence it ia "context-free'') 
while statement allows them. One and the same variable may occur 


several times in the hypothesis and the conclusion of a canon. 


(9.1) u word feu constant 

(9.2) u predicate feu constant 

(9.3) u sign f= u constant 

(9.4) u gonstant >; v constant foes uv constant 
(10.1) js <x Cy > differ 

(10.2) <u ev > differ FE <v < u> differ 


The following canons define a set or ordered quadruples named 
substitution. They specify the substitution of constants for 
variables in canons. Thus each canon of the first-level canonic 


system, if it contains any variables at all, gives rise to a class of 


15 


specific instances of ania These instances are obtained when 

any terminal string is substituted for the variables in the canon. 
Substitution is defined by a 4-tuple <weveset > 

The first element, w , is a word; the second element, v , isa 

variable; the third element is the original nonempty string s ; and the 

fourth element is the string t which results when the word is 


substituted for each occurrence of the variable in the original 


string. 
(11.1) w word ;; v variable b<wevevew> substitution 
(11.2) w word ;; s variable ;; v variable 3; <v.s > 
differ b< Weve S < S > substitution 
(11.3) w word ;; v variable ;; s constant E <w <VeSe5S? 
substitution 
(11.4) <WeVeS <q substitution 5; <wevexe<t t= 


<Wev< SX < qt > substitution 


Canon 11.1 defines the substitution of a word for a variable 
in a string consisting of only that variable. Canon 11.2 defines 
the substitution of a word for a variable in a string which does 
not include that variable; this substitution has no effect. Canon 
11.3 defines the substitution of a word for a variable in a constant 
string; this substitution has no effect. Canon 11.4 defines substi- 
tution in general. 

Canons 12.1 - 12.5 list the canons of the first-level canonic 


system. 


16. 


(12.1) - 1 digit canon 

(12.2) ke 2 digit canon 

(12.3) FE 3 digit canon 

(12.4) - x digit x number canon 

(12.5) Pe x digit ; y number | yx number ganon 


In order to make sure that indeed the canons are of the 
required format, we add: 
(13.0) Vv statement b= -v of legal canon format 
(13.1) us list of premises ;; v statement feu bv of 
legal canon format | | 
(13.2) u canon 3; u of legal canon format fx u instance 
of legal canon 


_ (Canon 13.3 defines the set of canons in which constants have 


been substituted for some or all of the variables.) 


(13.3) u instance of legal canon ;; v variable ;; 
wwod 3 <Wweveu, t> substitution b= 


t instance of legal canon 


Canon 13.4 defines a subset of the canons; this subset is the 
set of all canons which contain only constants. Derivations will be 


generated from "canons with constants." 


(13.4) - U instance of legal canen 3; u constant = 
u instance with constants 
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Canons 14, 15.1 and 15.2 define the sets named constituent of 
and occurrence ; these sets are used in defining derivation. It 
has been stated that a statement can be derived as the conclusion 
of a canon by showing that all of the statements in the premise 
have been derived; i.e., the premise occurs in the derivation. 
Thus, the meaning of the "occurrence" of a statement in a list of 
statements must be defined. The concept "occurrence" must be 
generalized to show that all of the statements in the premise have 
already occurred in the derivation; this generalization is the 


set constituent of. 


(14 ) v statement ;; r list of statements ;; t 


list of statements Je<v rv;t> occurrence 


(15.1) u list of statementska < u> constituent of 
(15.2) u list of statements ;; v list of statements ;; 


<u <v> constituent of $< w <v> occurrence = 


<uWe Vv >constituent of 


(16.1) A derivation 
(16.2) t derivation ;; w list of statements 5; 


u statement 
33 Wf-u instance with constants ;;<w <t > 


———————————————— 


constituent of f= tu; derivation * 


The canon (16.2), which also occurs in the definition of general 
canonic systems, is not itself admissible in a simple canonic system. 
In other words, the higher-level canonic systems that we construct 
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The final set to be defined is the set of strings derived 
by derivations; each of these strings is simply the last statement 


in some derivation. 


(17) tu; derjvation ;; u statement hose 
u legally dezived string. 


Canons 16 and 17 are of particular interest since they define 
the essence of a proof (derivation) and a law (legally derived 
string) in all mathematical systems. 

This completes the construction of the canons of the second-level 
canonic system. In this example the first-level canonic system had 
only predicates and terms of degree 1; modification to the second- 
level system may be made to handle predicates and terms of higher 
order in the first-level canonic system [Donovan and Doyle 1968] . 

The metalanguage describing the second-level canonic system 
(canon, substitution, derivation, ates) has not been defined; a 
third level system would be needed to define it formally. The form 
of the third-level canonic system is almost identical to that of 
the second-level system with appropriate changes in notation, i.e. 
predicates are underlined three times and the punctuation signs 
are tess! and + poe. We now outline briefly a formation of a 
third-level canonic system for this particular second-level system. 
We remark first that when we specified the second-level canonic 
system, we set up a standard frame, independent of S; ‘(canons 


5,6,7,8,9,10.2,11,13,14,15, 16,17) to which G1 -dependent capops 


Sebo ely Bie eS - GES 
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were added: 1,-2, 3, 6, 10.1,.12.. The same precedure will 
be followed here. The third-level ( (i+1)* -1evel, iz2 ) 
canonic system may be constructed from the second-level 


(i? 1eve1) canonic system by the following algorithm: 


1. To obtain the G p-independent (-independent) 
canons, use the standard frame, but make the appropriate changes 
in notation, i.e. underline the predicates one additional time and add 
one more semicolon wherever the sign test (;4) occurs. 

2. To obtain the G ,-dependent: ( 6; -dependent) canons, 
use the members of these sets listed in the definition of the second- 
level (ith_1eve1) canonic system as the terms of the appropriate 
canons of the second-level canonic system and sedensine the 
predicates one additional time. 2 

Thus, the (i+1) "Pe reve1 canonic system can be constructed 
from the ith-jevel canonic system with a minimum of effort. Thus, 
it can be seen that all higher-level canonic systens have the same 
basic form. Since no level defines’ its own operations, each level 
is logically consistent. - a | 

For purposes of discussion, at some level the metalanguage of the 
level must be defined informally. It appears that the second level 
would be an appropriate level to do this. Recall that, for a 
given problem, the first-level canonic system defines the problem; 
the second-level canonic system defines the ceerstion of the 


first-level canonic system. All higher-level systems define the 
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operation of previous-level systems. Thus, by selecting the 
second-level to informally define the metalanguage, the first 
level canonic system (which defines the problem) is precisely 
defined and logically consistent. 

For the case when the "“object'' canonic system G, is not 
a simple canonic system, the following changes will have to 
be made in the Secondstevel canonic system | 6, formally 


specifying "the anatomy and physiology" of 6. 


1) 6.1-6.4 7.1 8.1 8.2 are unnecessary 


( 6.5-6.9 7.2 8.3 8.4 alone will do in this case); 


2) 13.1-13.2 should be replaced by 
(13.1) uy list of statements 3; v statenent feu bv 
of legal canon format . 
(13.2) u canons; u of legal canon format = 
u accepted canon 
3) Obviously, all the 61 -dependent canons of c. will 


be chosen so as to reflect the particular components of 


6. 


Suitable changes may be made to allow for predicates of 


higher degrees. Examples of canons allowed in general canonic systems 


are: 
x A f— ax A 
axby A L- xy B 


xby A - xcyd B 
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x number; ©x, book descriptor -- x year of copyright 
(x,y €M; '©', ‘'a', 'bt , and ', ' are in V) 

The sentence symbol (predicate) will be denoted by 'sentence' 
instead of ' }' (D = {sentence}). 

The alert reader has undoubtedly noticed another departure from 
the traditional terminology: our avoidance of the term "terminal 
alphabet". The set V has been called just plain "alphabet", The 
reason is that this set does not necessarily correspond to the 
terminal alphabet of a formal grammar; it may include auxiliary 
symbols.” In this connection, see also Chapter VI. 

Before we study the different hierarchies of canonic systems, 
we wish to mention several results of Haggerty and to point out one 


of their implications. 


predicate has degree greater than 1. ['Reduced" means that a state- 
ment is provable in the second canonic system iff it is provable in 
the first one.] 


* 

In constructing canonic systems to correspond to regular or to 
context-free grammars, Doyle took the terminal alphabet of 
the grammar to serve as alphabet of the canonic system, and the 
nonterminal alphabet to serve as set of predicates. When, however, 
he considered grammars of type Oorl, using a completely different 
approach, he correctly uscd, in fact, the union of the terminal 
alphabet and the nonterminal alphabet of the grammar to be the 
alphabet of the resulting canonic system, but he said he included 
only the terminals. If in his construction the alphabet is to 
include only the terminal symbols of the grammar, then his construction 
would not yield a canonic system at all, since some of the "canons" 
included are of the form fF A nonterminal, where A is neither 
a symbol nor a variable. Whenever we shall hereafter mention these 
constructions, we shall assume that the appropriate correction has 
been made. 
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[Proof by replacing n-tuplea < 7 * <*e* «8. > by temns of the form 
ymbo 1 


8,98, ooo Ss) » where $ is a new » to be"used as a separator.) 


Theoren.t3. Any canon using indicated context may be reduced to a 
n 


cano thout indicated context (in other words, any canonic system can 
be reduced to a simple canonic system). 


[Proof. Each constant word will be replaced by a variable whose value is 
specified (by an additional premise) to be in an [ edequately defined] 
singleton set.] 


gor =3- Any canonic system can be reduced to one in which each 
canon has a 8 gle premise. 


{The proof uses the following basic idea: a canon like 
. term, pred, ; term, pred, $ wee term, pred [-term pred 
is replaced by ; 
< term, , term, «+. ,tem, > pr i [tem pred ; 
t f th 
where Li ae is a new predicate whose degree is ne sum o e 
degrees of pred » and then additional canons are introduced for the 
newly-created prédicates.] 
We remark that, as a consequence of Theorem H-2, the class of simple 


nonic systems if the class of sets definable by them is not different 
from the class of sets defined by the nonti-penetal canonic systems. 
However, the real significance of this theorem is quite different: we 
tricted class of simpler canonic systems which still realizes the same 
computational power. An additional argument is thet Alsop's "canonic 
translator" [Alsop 1967} uses only "simple canons". Moreover, there is 
nothing to guarantee us that if we apply a certein restriction on the 
class of all canonic systems and on the class of simple canonic systems, 
the resulting classes have the same computationel. power, or that the 
image of the first restricted class under the cue edfae low OF Theorem 


H-2 is included in the second restricted class. 


SRE RE itt ie engiio ey oA et oe tad, 
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CHAPTER ITI 


A HIERARCHY OF GENERAL CANONIC SYSTEMS 


Canonic systems were first used in specifying the syntax of simulation 
languages [Donovan 1966], including the features which cannot be expressed 
in Backus~Naur Form. Since canonic systems, while designed to be more po- 
werful than BNF, were too powerful when first defined (having the full com- 

_ putational power of Turing machines and thus being able to define non-re- 
cursive sets), it was felt that restrictions have to be applied so as to 
render the resulting classes of canonic systems incapable of defining non- 
recursive sets yet powerful enough to specify the syntax of any programming 
language. (Experience and intuition have indicated to us that for most pro- 
gramming languages the set of legal programs ie recursive and it is only 
specialized features of languages such as those found in PL/1 which have 


enabled us to prove that the set of legal PL/1 programs is not recursive 


search and defined a partial hierarchy of canonic ayetada: trying to in- 
clude in it correspondents for Chomsky's 4 types of formal grammars. 
Doyle's hierarchy has two distinct parts. The first part includes two 
classes of canonic systems, one equivalent in strong generative power to 
regular gremmars and the other equivalent in strong generative power to 
ear ree grammars: 


» Pheoren_D=3 ["'3" for "Type 3"]. The class of right-linear canonic 
systems and the class of regular grammars are strongly equivalent. 


Theores De2. The class of normal-form two-premise canonic systems 
and the class of context-free grammars are strongly equivalent. 


There was a clear correspondence between the two formal systems, to each 
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production in the grammar corresponding a canon in the canonic system, and 
vice-versa. All the predicates occurring in the canonic system were of 
degree 1 (sets of strings), and the canonic systems turned out to be, 

in our terminology, simple canonic systems. in the second part of his 
hierarchy, Doyle allows predicates of degree 2 to occur (sets of pairs 
of strings) but no predicates of higher degrees, and obtains a class of 
canonic systems equipotent to Turing machines: for any grammar of Type 0 
there is a canonic system which generates the same language. In other 
words, 


Theorem D-O0. The class of canonic systems with predicates of degree 


2 is weakly equivalent to the class of Thue semisystems (grammars of 


Type 0). 


From the proof of this theorem we also have: 


Theorem D-Os. The class of simple canonic systems with predicates 


SSS SSS 


of degree 2 is weakly equivalent to the class of Thue semisystems. 


Doyle also mentions “noncontracting canonic systems with predicates 
of degree 2", and states that these canonic systems generate only recur- 
sive sets and that for any given context~sensitive grammar one can find 
a "noncontracting canonic system with predicates of degree 2" weakly 
equivalent to it. We have not listed this as a theorem since the defi- 
nition of 'noncontracting" is entirely inadequate, especially when pre- 
dicates of degrees 2 (and higher) are included, and therefore the 
above-mentioned class cannot be considered to be defined. In this con- 


nection, see also Chapter V. 


This completes the second part of the hierarchy. The one-to-one 
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correspondence between the productions of the formal grammars and the 


canons of the corresponding canonic systema, while present in the first 
part of the hierarchy, could not be established in the second part, | 
owing to the inherent difference between canons of these eieenes of | 
canonic systems and the productions of Tl or TO grammars. If we direct 
our attention to canonic systems which do take context into consideration 
(canonic systems with indicated context, which are here. called ‘general 
canonic systems! ), 4 natural solution presents iteeit which not only 

fills in the above-mentioned gaps but ied brings about strong equi- 
valences with all 4 types of formal grammars considered by Chomsky and 
with any type of grammar definable in terms of productions, thus embedding 
the theory of formal _greumars into t that of | canonic “systems. This simulation 
of formal grammars by appropriately restricted canonic systems with indi- 


cated context is the object of the present chapter. 


The following definitions are analogous to Chomsky's: 


if each of its canons, except for five of chem, is of one of the forms 


(1) xPAwyy derivable - rPopy : ie e 
(2) A nonterminal . | 
(3) a terminal 

where 


(a) ?’ y.» wW denote perticuler. strings,. possibly cvet ys 
tb) A isa douteraianl (i.e. there is- eceorrecvoedion canon. “ot. 
the form (2) ); and 


(c) for every symbol from the alphabet there is either 4 canon of 
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form (2), or a canon of fom (3) (but not both) , 
the five other canons being 
(4) | J, derivable (% €v) 
(5) | X% terminal string 
(6) x terminal |x terminal string (x,y €M) * 
(7) x terminal ; y terminal string [- xy terminal string 
(8) x derivable ; x terminal atring Fk x sentence ; 


We may dispense with the predicate ‘nonterminal' altogether, and 


replace the present requirement (c) with a new one, (c'): 


(c') Any symbol in the position of A ina canon of form (1) ** 


must not appear in a canon of form (3) . 


Since this modification will simplify the proof of. the main equivalence 


theorem, we shall adopt it. 


* The effect of applying Canon (6) in a derivation can be achieved 
by applying Canons (5) and (7). Canon (6) was retained in order to pre- 
serve the correctness of future references by formula number. 


** There are two ways in which a canon like 
xABCy derivable }-xaBacy derivable 
may be interpreted as a canon of form (1): . 
1) Y= A; Y= C; w= BA; the expanded letter is B 
2) Y= AB sy = 3 w = AC ; the expanded letter is C. 


(Of course, this is just one canon, not two, and the two interpretations 
have no influence on the use of this canon in derivations.) \:In such a 
case, only one of the symbols that may be considered as being "the expanded 
letter" is requied to be a nonterminal (i.e. to be missing from the canons 
of form (3) ). 
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system of type 0 satisfying the additional condition that in all its 
canons of the form (1) the string w is non-null. 
or context-free canonic system (CFCS) if it is a CSCS satisfying the addi- 


tional condition that in all its canons of form (1) the strings g.¥ are null. 


or regular _canonic. system if it is a CFCS satisfying the additional condition 
that in all its canons of form (1) the string w contains just two | 
symbols, one terminal and one nonterminal (one for which there exists a 

canon of the form (3) and one for which there is no such canon), always in 
the same order. If the order is "nonterminal - terminal’, the regular | 


canonic system is also called a ljeft-linear canonic system. 


ewww ew new ewe ee EE Bee Ee Oe 
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canonic systems. Obviously, all results obtained for these types of 


is_a_Type_i_grammar_which generates the same language, and conversely. in 


other words, the class of Type_i_grammars is equivalent to the class 
We shall show how one can pass from grammars to canonic systems and 
from canonic systems to grammars. Let there be given a grammar G = N, T, P, £) 


of Type i Gi = 0, 1, 2, 3). The associated canonic system has the canons (4), 
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(5), (6), (7), (8), one canon of the form (3) for each element of T, and for 
each production AW + pw oy one canon of the form (1). The resulting 
canonic system is, by construction, a canonic syetee of Type i (i = 0, 1, 2, 

3); the strings (p, » may be empty. Suppose now a canonic system of Type 

i is given; the corresponding cvaimayTe-detined in. the following manner. 
The set T includes all symbols for which there is a canon of type (3); 

N will include all other symbols and for each production there will 
be a canon of form (1). It is obvious that the resulting grammar is 
by construction of the same éype as the canonic system from which Lt was 
derived. 

Before we show how derivations are simulated, we should clarify 
what is meant by a derivation in formal grammars. Two definitions are in 
use in the theory of formal grammars, and our construction below works 
with either of them. According to the first definition, any sequence of 
at the last application; a string is accepted iff: 

a) it has a derivation; | 

b) it contains only terminal symbols. 
According to the second definition, a pequence of applications of productions 
possible. The grammar is usually required to have for each nonterminal 
symbol, at least one penduetsen expanding it, in which | case a derivation 
produces automatically a string of terminals (if there were a Rone ere! 
in the string, the senUence could be continued and therefore does not 
constitute a derivation) ; a string is accepted iff it has a derivation. 
We shall use the first definition, but we remark that if the grannar 


is required to have for each nonterminal symbol at least one production 
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expanding it, a derivation in the first sense (according to the first 
definition) is also a derivation in the second sense (i.e. cannot be 
continued) iff its last string contains only terminal symbols, meee 
the two concepts of acceptance coincide. 

Let us consider a derivation in the canonic system. We shall 
simulate the derivation in the canonic system, ina step-by-step manner, by 
a derivation in the formal grammar. Without loss of generality, we may assume 
that the derivation in the canonic system starts with the canon (4). The 
derivation in the formal grammar simulating it will start with the one- 
character string =. Any canon of the form (1) will be simulated by means 
of the corresponding production; canons of other forms will be disregarded 
for the moment. We have thus obtained a derivation in the formal grammar simu- 
lating step-by-step the given derivation in the canonic system. If, the 
last string obtained is not only derivable but also a sentence, then this 
string has been obtained by applications of canons (5), (6), (7), with a 
final application of canon (8). Thgapplicability of canon (8) proves 
that the second condition for acceptance in formal grammars (condition 'b)' 
of the first definition of derivation) is fulfilled, and therefore the 
string is accepted by the formal grammar. 

Therefore we have shown that for every derivation in the canonic 
system there is a derivation in the grammar. The converse result is proved 
similarly. This completes the proof. It is easily seen that what we have 
proved amounts to strong equivalence. We can therefore assert: 

(strong form of the general equivalence theorem) 


Theorem 1' The class of Type + grammars iy strongly equivalent to the 
class of Type + canonic systems, for t= 0, 1,52, 3._ The classes of ‘linear, 


onessided linear, metalinear, sequential, etc, yrismars are strongly equivalent, 
aespectively, to the classes"éf Itnear, ofe~sdded linear, netalinoar.. 
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system (1CSCS) if it is context-sensitive and in all its canons of type (1) 
the right context (the string ~) is empty. . [And similarly for rCSCS. ] 
These definitions are the natural counterparts of the definitions for left- 
context -sensitive, [ right ;context-sensitivd grammars. One-sided context- 
sensitive grammars have been studied but with no significant results to date. 
About all that is known is that they can generate non-CF languages (and cannot 
generate non-context-sensitive languages). It is conjectured that they 
cannot generate all context-sensitive languages. 
Another type of formal languages (left-tontext-sensitivd have 
been defined in Mandl 1968 and shown to be weakly equivalent in generative 
power to context-sensitive grammars. This gives rise to a new type of 
_ canonic systemsstrongly equivalent to left&context-sensitive grammars. 
These grammars seem to be new and interesting and therefore we will 
discuss these further here. 
The definition below was sugceuted by Booth's definition of 
context-sensitive grammars [Booth 1967] as a phrase-structure grammar all . 
of whose productions of any of the following three forms: 
(9), 


(30) t; At ou k 


(11) 0, Al, +0, wW by 
He further remarks that productions of the forms (9) and (10) are not really 
necessary (since they can be obtained by. adding a few rules of the form (11) 
and by adding a few déwnontetainil symbols) but they ‘inte his exposition 
easier to follow. Suppose now that the right cpabexts are null in all 


these rules (and similarly for left contexts). .Then the rules have the form 
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where the first and the third are left-context-sensitive, rules and the 
second is not. This second form of production will be the only form 
allowed in the grammars we are going to define. . . : 
Definition 8. A left-tcontext-sensitive gramar is a phrase-structure 
grammar all of whose productions except perhaps for a rule roa ; 
are noncontracting productions of the form 

(12) @A + G4 » A & N, pe V* = (NUT)*, w #A. 
{Similarly for right-*context-sensitive grammars] It may be remarked 
that this type of production is not a particular case of the general 
production pr w > Yw y as the left context -sensitive, rules were. 
Likewise, the corresponding type of canon is not a particular case of 
(1), and so we cannot (yet) define left-*context-sensitive, systems as 
a special case of Tl (or Type 0, for that matter) canonic systems (see 


footnote). We shall use instead a definition which is similar to 


Definition 11. 


Definition 9. A left-*,context-sensitive, canonic system is e 


canonic system which includes the particular cenons (4), (5), (6). (7), 


(8), a finite number of canons of the form 
(14) xepay derivable j- xWeY derivable 


and one canon of form (3) for each symbol A occurring in some canon 


(14). [Similarly for right-*,context-sensitive, canonic systems; (14) 


is replaced by (15) xAwy derivable = xyuy derivable J 


32 


Theorem 2. For any given context-sensitive grammar, there exists a | 


the same language and obtainable from the original one by a uniformly 


but it will be evident how to start the proof should one wish not to use 
the reductions. Definition [Kuroda] A context-sensitive grammar is 
of order n if there appears no string of length greater than n in any 
rule of the grammar. Lemma 1 [Kuroda] For sais context-sensitive grammar 
of order n (n > 3) there exists a context-sensitive grammar of 
order n-1 generating the same language. 
(By repeated use of Lemma 1:) 
Lemma 2.-[Kuroda] For any context-sensitive grammar there is a grammar 
of order 2 equivalent to it. 

Let G be the given grammar. By introducing new terminal symbols, 
we can convert it to an equivalent grammar in which terminal symbols appear 
only in rules of the form A+ a ("terminal rules"). 
Remark. [Kuroda] The original grammar might have been given in an 
apparently more general form* in which there might be a production which 
rewrites more than one syaboll: 

(16) a lw, | < 4 

w, € (TUN)* . N(TUN)* 


|w 
e 


*We can thus define two new types of canonic systems ("Types 1' and 0 "), 
with canons (4), (5), (6), (7), (8), canons of the form (3) (and (2)) and 
canons of the form a 

(17) XW derivable -- XWoY derivable 


where Ww, includes at least one nonterminal, with or withoyt the restriction 
|w,| < ||". Using Kuroda's remark and our general equivalence theorem, we 
can conclude that these types of canonic systems are weakly equivalent, 
respectively, to Tl and TO canonic systems. At this stage we could redefine 
left-*context-sensitive canonic systems as a certain special case of 

Tl (context-sensitive)-canonic systems. 


equivalent to the class of left-*context-sensitive. canonic systems. 
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Similar theorems hold for right-*context-sensitive, canonic 
systems and grammars. A further application of the general equivalence 


theorem yields: 


eorem 4. For any given context-sensitive canonic system there is 


strongly 
a=—ag 
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Figure 3 


Most of the equivalence theorems of this chapter are summarized 
in Figure 3. For completeness' sake, we also included several trivial 


results. 
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CHAPTER III 


SUBRECURSIVE_CLASSES OF CANONIC SYSTEMS 


Oe ee ee ae ee ee a OO ee ee 


Both these hierarchies of canonic systems, as well as the hierarchy 
of formal grammars, have no class of system to correspond to the class of 
recursive sets. (''Noncontracting canonic systemswith predicates of 
degree 2'' were claimed to be situated somewhere between context- 
sensitive sets and recursive sets, both inclusions being in the weak 
sense.) 

We state here in what sense(s) would a class of canonic systems 
(formal grammars, etc.) correspond to recursive sets and elucidate 
why no class of system has been found equivalent to recursive sets. 

“It is well-known that there can be no procedure for deciding 
whether an arbitrary recursively enumerable set is a member of a 
given non-empty collection of recursively enumerable sets, except 
in the trivial case when all the recursively enumerable sets are 
members of the collection. This is Rice's theorem; see, e.g., 
Rogers { 1967 , p. 324 (Th. 14-XIV (a) )]. Consequently, it is clear 
that we cannot hope to find a class of canonic systems which (a) defines 
all recursive sets, and only recursive sets, and (b) the class includes 


al] the canonic systems which define recursive sets. 


” [Mandl 1969] _ 
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We might hope that there exists a "small" class of canonic 
systems which define all and only recursive sets however realizing 
that the class cannot include all canonic systems which. define 
recursive sets. Or, stated in another way, it might be the case 
that a certain class of canonic systems (characterized by a finite 
set of properties, and such that is is decidable whether a given 
canonic system meets those properties), would corressond to the 
recursive sets in the sense that 

-only recursive sets are generated by canonic systems of 


that class (the class is "subrecursive"') ; 


(Ayre a ee See ere cr 


for every recursive set, there is among the canenic systems 
of that class at least one canonic system defining cas 
given recursive set band there may be such canonic. systems outside 
the considered class). 
We shall prove that such a class cannot exist, i.e. if a class 
of canonic systems defines only recursive sets, then it cannot define all 
recursive sets, even if it does not have a wonepely in defining recursive 
sets. This result can be restated succinctly as: "Subrecursive classes 
of canonic systems are strictly subrecursive."' | 
Theexee.g-: Nashass.of.canopic.systews. (ar bf any. fAnitely-specifiec 1 
formal systems, for that matter) can corre nd exactly (in the sense of 
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(a) ) above) to the class of recursive sets. In particular , CST] Soe 


{ Becursive_sets] *, 


*The reader may have noticed a similar statement, without proof, in Donovan 
and Doyle, 1968, p. 46; ''Thas, a noncontracting canonic system can only define 
a recursive set. However, it cannot define all recursive sets; some 

recursive sets can be generated only to a TO grammar.", An earlier work 
claimed to have proved this by exhibiting a concrete example, but the 

proof was eliminated when the example turned out to be a context-sensitive set. 
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Preof (based on an idea of Hopcroft and Ullman [1969 , §8.3] ). 


Since canonic systems are finitely specified, we can canonically 
enumerate all canonic systems, the canonical index encoding the 

whole description of the canonic system ("G6delization" of .canonic 
systems.) Likewise, we can canonically number (encode) all the words 
over the denumerably infinite list of potential symbols; let 

u, be the Kt) work in this numbering. Since it is assumed decidable 
whether a certain canonic system is of this type or ast we can strike 
out all the canonic systems pot.ofthis.txpe, thereby effectively 
enumerating all the canonic systems of the type considered: 


C ’ C 2? 6 got . By the hypothesis, all these canonic 


systems define recursive languages Dae £.. x. ee . Consider 


the set 


f~ |ugt, } 


It is different from all Page izil, 2, ... 3 yet it is recursive. 
Therefore no type of canonic systems can define all and only recursive sets. 
Remark. A recursion-theoretic argument yields Theorem 7 as an immediate 
consequence of the known theorem that the class (set) of all recursive 

sets [ while recursively enumerable as a class of r.e. sets [Blum 1965; 
Suzuki 1959]]Jis not characteristically inimevebie: Proof.of_ the reduction. 
For all subrecursive classes of canonic systems the proof of the subrecursive- 
ness has been done by exhibiting a decision procedure. In other words, 

if we have a finite description of a canonic system, we can interpret 


it not only as giving a procedure for enumerating a set but also as giving 
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a procedure for computing the characteristic function of the set, i.e. 
that we can find not only an r.e. index of the generated set but also 
a characteristic index thereof. Therefore [a description for] g 


cauonic_systew_belonuging_to_a_subrecursive_class_is_akip_to_a_cbaracteristic_ 


index_for_the_recursive_set_defined by that_canonic_system- 


*The elucidation of this point owes much to a discussion with Professor 
Patrick Fischer and Professor Juris Hartmanis at the Third Princeton 


Conference on Information Sciences and Systems in March 1969. 
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CHAPTER IV 


CANONIC SYSTEMS FOR CONTEXT-SENSITIVE SETS 


In Chapter II we mentioned Doyle's work onahierarchy of canonic sys-~ 
tems, where, inter alia, it was stated that the NCST were situated some- 
where between context-sensitive sets and recursive sets. Let us now take 
a closer look at the definition of NCST. It reads ("Definition 2.13"): 


"A noncontracting canonic system (NCCS) is a canonic sys- 
tem in which each application of a canon results in the length- 
ening of the string denoted by the predicate defined in the 
canon. That is, if A@P and w€A and to prove weEA it 
was first necessary to prove B @B , then Ju > lB] - That is, 
in a derivation, me we have 

eee 5 BBS -ee 5 WA 5 eee 
then fuf>bl. ¢ B may denote the same predicate as A ) 

A noncont ract ing canonic system with predicates of degree 
two (NCST) can be constructed to describe the language gener- 
ated by a Tl grammar; this canonic system has the same basic 
structure as the canonic system equivalent ot a TO grammar 
with the additional length restriction." 


Objections to the definition 


1. "the string denoted by the predicate defined in the canon" . The 
conclusion of a canon has only one statement , and therefore it involves 
exactly one predicate. However, this predicate is not necessarily of 
degree 1 , so we cannot refer to "the string". 

2. "lengthening" . That unspecified string is longer than something. 
Longer than what? The hypothesis of a canon may include many strings and 
many n-tuples (tuples) of strings. 

3. " fofafa[". If w and B are tuples, their length is unde- 


fined. If they are strings, then something has to be said about tuples, 


or at leastabout pairs, since predicates of degree 2 have to be allowed 


in order for Doyle's proof of [Type 1]@ [NCST] to work. 
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4. [Concerning the derivation] Although on p. 18 of that paper it 
was said "In this paper, a derivation will consist of a sequence of canons 
instead of the sequence of conclusions of these canons", here we have to 
revert to the original definition of derivation (as sequence of conclu- 
sions). When we do so, we see that an axiom may appear anywhere in this 
sequence, and it is not necessarily longer than all its predecessors (or 
shorter than other strings that may follow). Moreover, not only strings 
appear in a derivation but also tuples. 
5. "( B may denote the same predicate as A)" . B does not denote 
a predicate; rather, it is a predicate. Formally, predicates are and 
remain elements of P ; and when we write P = {a » BS ve also mean that 
A and B are different elements of P . We could have introduced meta- 
variables ranging on predicates, | Vi , V, bac » in much the same way in 
which we tacitly introduced B ,w, gy »W to stand for particular 
strings, and in that case we could have written | 
ooo 3 BUG eee gs w Us ; eug 
and said that the meta-variables v, and Vv, may denote either two 
distinct predicates A ,B or one and the same predicate A. Since we 
have not introduced such "predicate-variables", and since A , by defini- 
tion, is not the same as 8B , one should have said 
",... if we have fae 3 B 


3 
or we have eon 3 B 


then ful le] ," 


ee W 
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We therefore see that, at this stage, there is no such thing as non- 
“ contracting canonic systems with predicates of degree 2. Correspondingly, 


this chapter will be devoted not to proving something about the [undefined] 


and will be such that 


1] Doyle's claims will hold for it ( [Type 11@[new class] & [Rec] ); 
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As it very often happens in such cases, the real problem is not to 


prove but to "guess" what to prove (and to "improve a bad guess" by trial 
and error). . ‘ 

We cannot define ‘noncontracting' [nc] as "such that the sum of the 
lengths of all strings in the hypothesis (whether appearing ciated or a8 
elements of tuples) is at most as large as the sum of the lengths of all 
the strings in the conclusion" » since then a canon like 

x A; x Be x ¢ 

would not be noncontracting, which is not only Sounterdncultive but also 
does not allow us to salvage the proof for "[Type 1] © [NCST] ". For 
the sunt ening case when no predicate of degree | 2 appear in the conclu- 
sions of the canons, one could try | | 

“the string in the conclusion is no shorter than any of the 

strings appearing in the hypothesis, whether they constitute 

terms of degree 1 or are elements of higher-degree terms" ; 
We shall reconsider this suggestion later on (in a modified form); at 
the moment we have to abandon it because we plan to use as much ae pos- 
sible of the existing proof [ Donevan & Doyle 1968, pp. 43-44], and the 
canonic systems constructed in this proof are, as we noted in Chapter II, 


sions of the canons). 


Since the real problem here was the finding of of a good definition, 
we think it would be more instructive for the student of canonic systems 


if we try to present how the definition was arrived at, instead of just 


exhibiting it and showing that it works. 
' Doyle's proof of the recursiveness used a multitape Turing machine; 
the idea was to show that this machine always halts, thus deciding mem~ 


bership in Wb ) . We intend to prove more, viz. that the set us ) 
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defined by the canonic system is context-sensitive. For this, it will 


be enough to show that the multitape Turing fiachine which decides whether 
rr) eu) never uses more than | squares on any of its tapes. 

As our first step, we modify Doyle's Turing machine to have, in 
addition to one tape for each predicate of degree 1, also k tapes 
for each predicate ‘of degree k , for k= 2, 3, ... (all the tapes oe 
distinct). In Doyle's construction, the Turing machine exhaustively ge- 
nerated all strings of length < jo| in the language defined by the ca- 
nonic system.and checked for the occurrence of W on the tape assigned 
to the sentence predicate. Naturally, all strings, on all tapes, had to 
be placed one beside the other (separated by special characters), and so 


the storage space for far from being linear. One could achieve linearity 


instead of being appended (with a separator) to the current end of the 
tape. However, each string has to stay available indefinitely, for later 


use in derivations (Fig. 4). 
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Figure 4 
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More exactly, it has to stay indefinitely available in a11 cases EXCEPT 


when each canon has at most one premise (if 0 premises, the canon is an 
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Figure 5 


each statement on a computation path is used once immediately after being 
obtained and never needed again. This will be the main idea of our proof. 

In order to achieve thie situation we have to reduce our given cano- 
nic system 6 ("of Type X") to one G, in which canons have at most one 
premise and which is also"of Type X" . Forgetting for the moment of the . 
"Type X'" restriction, we notice that guch a reduction is always possible: 
this is one of Haggerty's results (Theorem H-3, here). There are exactly 
3 ways in which the canons of G ‘a are constructed: 


1) they may be inherited from (> , if they have at most one premise; 
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2) they may have been included in Gy t° replace some canon 


ty pred) Sede th pred | to pred 


of G ; general form: 
Cty toes ety pred pred, .-.pred [-«, 2zed, ’ 
where the degree of the newly introduced predicate is the sum of the de- 
grees of the n_ predicates in the hypothesis of the old canon; 
3) they may have been required by canons already in Gn : 
if G has canons 
Sty eet, 


t ' ' , far ° ° 
gt ett etl? s t' st » and R'S' is already inG,, 


F 
at 
Ke 


then it will also have the canon 


tl gosiat! > is p—<t t'> R's! : 


Stree Ps a o< re 


m * 


where deg(RS) = deg(R) + deg(S) , deg(R'S') = deg(R') + deg(S') . 


From here we get the final hint as to how to choose "Property X" : 
if we are to use the method of proof sketched above, "Property X" has to 
be invariated by '2Y,'3)'. '3)' suggests the following: 


PROPERTY xy - In each canon of the Canonic system: 
If the predicate in the conclusion is of degree k , then in each premise, 
separately, the tuple can be decomposed * into k parts (possibly empty), 
which are contiguous, mutually disjoint, and collectively exhaustive; and 
there is a permutation of these k parts such that, for every i ,4@i¢k , 
each element in the rag part ** always represents a string which is 


no longer than that represented by the has element of the term [of order 


k ] in the conclusion of the canon. 


* It is understood that no element of any tuple is to be cut in the 
middle by the decomposition. 


** The part which became the i) ster the application of the per- 


Spates oes 


Examples: <xey>A fez <¥3> AA 
<x.y>k f-o3, 22> BD 


As a particular case, we have: 


PROPERTY X, . [Same as X 


2 : es 
conclusion is. compared with those in the hypothesis: there is an integer m , 


m¢k , such that the — element of the conclusion always represents a 


but only one of the elements in the 


string longer than those represented by any élement ‘in any term in the 


hypothesis .] 
Example: _ €x 09> A «xy . a> B 


We shall now clarify what we mean | by the expression ‘always repre- 
sents a shorter string’ . When a canon is used in a derivation it dose 
not appear in its. general form but as a particular ‘canon_instance, in 
which all the variables are cea taced by particular strings. What Pro- 
perties X, ( is=1,2) require is that for each canon there be a 


decomposition of the kind specified above and such that eee all the 


canonic system * the above-mentioned decomposition yield particular 
strings which satisfy the length relationships specified in the defini- 


tion. 


* For example, if a _canonic system contains only the canons 
k— 3 digit 
f— 5 digit 


x digit }- x number | 

x sigit ; y mumber on xy numbe 
then 1535 digit tf 535 number number! is a legitinate {hateiicd of one of the: 
above canons, but can ‘never appear aa a derivation. We shall be ‘concerned 


here with canons like 


<*.y> greater in length ; y very long string } x yore lang atzing 


which are so decomnogable. because anv ENON Se At fandian tendemaw 


Thus in order to ascertain whether a certain c.e* has Property X, we 
have to make sure not only that the canons have certain forms but also that 
an infinity of canon instances satisfy certain restrictions. When we talk 
of classes of canonic systems we usually require that membership in the 
class be determined on the basis of a finite set of canons, not on the basis 
of an infinite set of canon instances; therefore we now proceed to define 
properties similar to Properties Xx, but such that they involve the canons | 
themselves rather than an infinity of canon instances. 

Let us consider first a term of degree 1 , e.g. xaby , where a , b 
are symbols and x , y are variables. Whatever the strings represented by 
x , y may be, the resulting string is always longer than the string repre- 
sented by xxyabb . We shall write: 

yx 4 xaby ¢ xxyabb 
Other examples: , 

x Qe 

x @ xy 

x Ss xb 


xx3y Pyx é 


We have to make one more preparatory digression before we formally define 
the relation PS - Since we want to use Doyle's eonserietion of a c.s. for 
a given context~sensitive grammar, let us have a closer look at that cons- 
truction. (We want to make sure that the definition of — will be chosen in 


such a way that the c.s. constructed will have Property x +) Its "most im- 


<abc .defg> greater in Jength ; defg verx long airing aes very long string 


» while legitimate as an instance, can never appear ina derivation in a ¢.s. 


which defines ‘xX. Y> greater in length’ to mean oo x is longer than y ". 


* "c,s." = "canonic system" . 


SP A A On pa a1 at 
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portant canon", and the only one which is likely to cause problems, is 


(1) wxz derived string ; x @ > production ;<y .xpgreater in length 
: : wyz derived string . 


The problem is that we need Wxz Qwyz » where x , y are not comparable 
(being two distinct variebles). All we want is that always the string re- 
presented by y be at least as long as that represented by x , and this 
is ensured by the premise <y,x>greater in lenth ( fy|pkkl). The defi- 
nition will include also this case, thus "legalizing" canon (1) - The pre- 


dicate greater in length used above is defined thus: 


(2) x terminal {- x symbol 
(3) x nonterminal - x symbol 


(4) <a .1> Length 
(5) <x y> length ; z symbol - <xz < yYi> length 
(6) <x. y> length ;<z.yl> length [<2 .x> greater in length * 


(7) <x .y> greater in length ;<y.z> greater in length [-<. Z> grea- 
ter in length 


(8) <x ~y> length ;<z y> length [- «x, z> greater in length 


Canonic systems which include the canons (2)...(8) will be called 
nons satisfy themselves the requirements placed upon canons of canonic sys- 
tems satisfying Properties Kis Ky (i.e. they are decomposable in the pre- 


scribed manner). 


* It is because of this canon that the c.s. which include canons (2)... 
(8) are not simple. The second element of a pair.in length represents 


the length of the first element expressed in l-ary notation: O='1', 3='1111', etc. 


SAR gts ETL EES ON 
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Definition 10. (Definition of < (with respect to a particular canon | 


in a particular canonic system)) 


ela. For any words a, B 
un <a ( % is the empty word) 
agp .#. als|p] 
Lb. If x is a variable, then 
oe 
xX &.X 
-lc. If a premise of the form <v.u> greater in length is 
included in the canon, where u,v are variables, then 


u<v (in that canon) 


If ty : ty Fs ty ; ty, represent concatenations of variables and words, 
2a, [Transitivity] t,t, - t,@t, - ed ti <t, 
2b, [Side-by-side concatenation of inequalities} 


t1<t, + t3<t, . SP t,t, < tot, 


-3. No relationship t, ¢t, is valid unless it is deduced from 


1 2 


a finite number of instances of .la. , .lb. , .lc. by means of a finite 


number of applications of .2a. , .2b. 


With the help of the relation << we are now in a position to define 


PROPERTIES Y for length-monitoring canonic systems.These properties 


1°? %2°? 
are defined in a similar manner to that in which we defined Properties x) ’ 
x, » but: 


1) the expression ‘element ty always represents a.string which is 


no longer than that represented by ty is replaced by ' ty = t, Ds 


2) the canons (2)...(8) , present in any length-monitoring c.s., 


2 REN TER RIS SBE Bert S I RPE NES BN OE ARN ac et SR RGR 
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are not required to be "decomposable" . [Notice the formal chenge in the 


concept of "decomposability".] [We shell later consider other types of length- 


monitoring c.s., in which case '2)' will refer to the canons there used for 
monitoring length.] 


We note that Property Y, implies Property X, “( i121, 2), and that 
one can immediately’ tell, by inspection, whether a c.s. hes Property yy 
( 1 =1, 2.) or not (this was not the case for Property X, » Property X, ys 


This latter fact justifies the following definition: 


pectively Yy ) 1£ it has the Property ¥, (respectively Y ) . [The name 


'type' is reserved for properties detectable by inspection.] 


a) Given any context-sensitive grammar, one_can_uniformly effectively 


construct_a length-monitoring canonic_ system of Type Y,__(_¥,_)__defining 


b) For_any_length-monitoring canonic system of Type _¥)__( Yo_)_2 the 


een mnanunbhnaae fwuaadwe oh wea wa 


aera wrenwin mas Snaananeawanea 


language defined by it ig context-sensitive (and a_suitable grammar can_be 


constructed in a uniformly effective manner) . 


Proof. ‘Since Type XY, implies Type Y, , it is enough to prove ‘a)' 


t t . 
for Yy and 'b)' for Yy : 


{Type 1] & [Type Y,] Se [Type Y,] Se [Type 1) 
a bb marr 


["the class of languages for which there is grammar of Type 1 is included. 


in the class of languges defined by c.s. of Type Y. , which ..!', etc.] , 


a) All we have to show is that the length-monitoring c.s. constructed 


in Donovan & Doyle 1968 pp. 43-44 always satisfies Property Y, , and this 
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is ensured by the manner in which we chose our definitions. 
(Remarks. There is no need to first reduce the. ana to one of order 2; 
~ the alphabet of the c.s. includes not only the terminals but also 
the nonterminals, and £ is included among the latter; 
= enero te ao need fox the canonic sytem. ttaelt to cerine. the: cone 
cept ‘string’, since this concept is part of the definition 
of canonic systems in general; 


- for formal reasons, the canon 


[ y string ;] <Cyy> production ; <y,i> greater in length 
fk- y derived string 

is replaced by the two canons [5 initial string and 

x initial string ; <x ¢y> production ; <y_x> greater in 


length -- y derived string , 
where initial string is a new, singleton predicate. ] 


b) Applying Theorem H=3 x, we reduce the given c.s. of Type 2 to 
one in which no canon has more than one premise.. Since the original c.s. 
had Property X, , and since this property is invariated by the construction 
in Theorem H-3 , the resulting c.s. also has Property xX, .. We shall now 
eeau trace (in a uniformly effective way) a nondeterministic multitape LBA 
which recognizes the language defined by the reduced c.s. Awhich is the same 
‘as that defined by the original one). For each predicate of degree k (k= 
1,2, ...), the LBA will have k tapes. Since each hypothesis has only 


one canon, the derivations have a certain "Markovian" character (see Fig. 5). 


Each statement obtained in the derivation is used in the fmmediately following 


* I am grateful to Amitava Bagchi for the suggestion to use Theorem 


‘H-3 in this proof. 
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the tapes corresponding to a predicate when this predicate reappears in a de- 


rivation. The LBA will simulate nondeterministically the derivation and will 
halt when a sentence is derived; if a string w is a sentence then there 
is a computation path of the LBA which halts with w displayed on the 
sentence tape, and conversely. The last step in the derivation of w is 
of the form | 


< abe coegh > AB.. .M aa (A) sentence 


by Property x, we have 


Jol & lal 
Jol B lel 
Jol B ul ' 


Tracing back our derivation, we see that, in view of the Property Xi 
“ts is at least as long as any string in the derivation, and therefore | w| 
is an upper bound, on each tape separately, on the amount of space necessary 
for recognition. 
The proof will now be coneladed ‘by replacing the multitape LBA by a 
{"multitrack"] one-tape LBA and noting that each step in the chain of cons- 


tructions 


c.s. of c.8. with context- . 


; muititape 
Type Y, =? one-premise =~ LBA ==> LBA => sensitive 


canons : grammar 


is uniformly effective. 


As an illustration to this proof, we now show how the multitape LBA 
would handle the canonic system which was chosen by Haggerty to illustrate 


hia procedure. 


A 


Derivation for ‘aabbbcaabbb' ; bB; <a 
<c aabbb > CD ; aabbbcaabbb E ; 


y> ac | <xe ye ac 


cz> ABC | 


cz > ABC 
© 7 ABC 
cz > ABC 
o> ABC 
c Y? ABC 


ce > ABC 


Is 18 ls Is 


bb > AB : 


< aa 


bbb > AB ; 


The multitape LBA has 16 tapes (=5-14+4°2+1°3) . The following figure 
(Figure 6) shows the contents of these tapes at successive stages of the 


simulated derivation. 


Lf +4 

b- bB 

Kt cs 

4 -- axa 

xB f- bxB 

CR exsg 

xA 3; yB kK xyD 

& ; yD ft wv E 
Derivation for 'aabbbcaabbb' “b B 


aabbb D ; aabbbcaabbb E ; 


Transformed canonic system: 


sea e ens ezreoaannwes wesw a= 


o & 
la {ow [> 


la Ie [> 
la [oy [> 


<xX_ y>AB |- xy 
<x yofD [|-yxy 


Ie io 


}e<a. i 


z> ABC feccz 


b< < 


qty 


A 
tad 
Aa 


VA 
> 
i) 


“wo 
[' 


¥ 
e 


by> 


b> 


by> 
b> 


< *Y> 


xy> 
ay 


. (The decompositions are shown by 
suitable underlining) 


les Ve le 18 


cD 


Is | 


if 


{The arrows mean ‘longer than’ .] 


fos 
L~a 


aabbbcaabbb 


is 


f BC (not used) A_l6etape LBA simudates a derivation in a canonic_ system 
he AC (not used) 
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CHAPTER V 


FURTHER HIERARCHIES OF CANONIC SYSTEMS 


The purpose of this chapter is to apply the main result of Chapter IV 
toward the development of imp roved hierarchies of canonic systems. 

Let us consider Doyle's hierarchy again. This hierarchy has two se- 
parate parts, one part comprising classes of canonic systems strongly 
equivalent to the class of regular grammars and the class of context- 
free grammars, and the other part comprising a class of canonic systems 
weakly equivalent to the class of unrestricted rewriting systems (Thue 
semisystems). The hierarchy was claimed to include another class of ca- 
nonic systems, situated somewhere between context~sensitive grammars and 
recursive sets, but we have seen in Chapter IV thet this class was not 
completely defined. In the same chapter, two classes of canonic systems, 
Pes Senge cmon sco say canontc veyacemscot Type. ta. 6 to) s-Mere Reever te 
be weakly equivalent to the class of context-sensitive languages. There- 
fore if we add any of them to the two parts of Doyle's hierarchy we obtain 
a complete Hisrarchy_of Canonic Sys tens: where by ‘complete we mean only 
that all 4 types of grammars are represented. (The hierarchy presented in 
(Chapter II had correspondents not only for the 4 classic types of formal 
grammars but also for any class definable in terms of productions.) 

While completeness is certainly a very desirable property, we cannot 


consider ourselves satisfied with it and ignore the fact that this com- 


bined hierarchy is quite heterogenous: for Types 3 and 2 it provides 
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simple canonic systems with predicates of degree 1; for Type 0 - 
simple canonic systems with predicates of degree 2 ; and for Type 1 
the canonic systems are not even simple. The form of the hierarchy may 
be schematically summarized as 
Sl $1 G2 $2 
(for Types: 32 1 0) 
Our first step toward "homogenization" will be to reduce the third 
. class from G2 to S2 . Clearly, we can always reduce a general c.s. 
to a simple one by using Theorem H-2 , but we need a class of simple 
c.s., weakly equivalent to context-sensitive grammars, and the property 
‘obtainable from class A by eliminating contextual references! is not 
a good criterion for class membership, since a criterion should refer to 
the form of the new system, irrespective of how the c.s. was obtained. 
We have seen that the length-monitoring c.s. cannot be simple, by defi- 


nition, since they all include the offending canon 


<x. y> length 3<z yl> length }- <z <*> greater in length 7 
If we modify IV.(2)...(8) by replacing this canon by the canons 


: <x. y> length 3 <2. yu> length; u unit F<. x> preater in length 
- lL unit [singleton predicate] 


and call the canonic systems which include (1) and IV.(2)...(5),(7)... 


(8) s-length-monitoring canonic systems, we can build for them a theory 


Definition 12. A simple s-monitoring canonic system is of Type__¥, 


(respectively Yo ) aif it has Property Y, ( Yy ). Property Y ( Y, ) 


SUSE D SS SERGE ca IES Re SRA Ri RR ia OREM EY os iSync aC oe wien ? eer Rae peer ae Sie . 
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for s-length-monitoring canonic systems is defined in a similar menner as 


for length-monitoring canonic systema, but the condition '2}' in that de- 


canons used here for monitoring length. 


Theorem 7 


a) Given any context-sensitive grammar, one can uniformly effectively 


ewaennonan han waenzean on on on oe et eune 


construct_a simple s-length-monitoring canonic system of Type Y,__( ¥,_) 


b) For_any_simple_e-length-monitoring_cenonic system of Type 


For any sim zie tori of | we ty 

Y 
‘as 
uniformly effectively find a grammar for it). 


)_2 the language defined by it is context-sensitive (and one can 


Proof. a) The only contextual referencing in the canonic systems of 
Theorem 6a was in Canon IV.(6) . If we replace that canon by (1) we 
get a canonic system which is simple, s-length-monitoring, of Type Yo 

(and therefore also Y ) , and defines the same language. 
/ b) Completely similar to the steak of Theorem 6b . [Theorem 7b is 


not a particukr case of Theorem 6b since s-length-monitoring c.s. are, 


formally, not the same as length-monitoring c.s.] 


We have thus obtained a hierarchy of the form 
$1 $1 S82. §2 ;: 


i.e. a hierarchy of simple canonic systems (of which the last class 
contains all the simple c.s. with oredicates of degree 2), and we shall 


try to reduce it to the form 


S1 Sl $1 Si oe 
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The last class can easily be so reduced. For any r.e. set there is 


a simple c.s. with predicates of degree 2 which defines the given set, 
and this c.s. may be reduced to one with predicates of degree 1 (by The- 
orem H-1) while remaining simple; and the converse. result. is certainly 
true, since sets defined by canonic systems are always recursively enu- 
merable. 


The hierarchy has now the form 
S1 Sl S2  §S1 . 


‘Unfortunately, salad en H-1 appears to be of no further use in reducing ‘the 
form of the hierarchy, ‘since none of che. 4 classes mentioned in this 
chapter as s being weakly equivelent to context-sensitive grammars. 
(length-monitoring canonic asec ees of Type Yy / of ‘T90e BS ae 

simple s-length-monitoring c.s. of Type ¥, 4..0£ Type Y ) 

is invariant under the transformation involved.in the proof of Theorem 
H-1. 

Having thus arrived at an apparent ''dead end" in our endeavors to 
develop and simplify Doyle's hierarchy, we now. consider the other basic 
hierarchy, the hierarchy of general c.s. with predicates. of degree 1 
(of the form Gl Gl. Gl Gl oe he ee ues ) 
which was introduced in Chapter II, and apply to it Theorem H-2. 

It is easily seen that we obtain indeed 4 types of canonic systems, 
i.e. valid criteria can be stated Giscapiine only on the form of the 
transformed canonic system) for meabarabis sts ¢.s. in a type. These 
types of c.8. may also be introduced independentiy. ~The following de- 


finitions are analogous to Definitions a 


TA LASTER AT 


of its canons, except for 4 of them, is of one of the forms 


(2) xuy derivable ; uU; v V | xvy derivable 

(3) t- pM (x, y, u, v are variables) 
(4) [+ «4 temninal 

where 


(a) wu (a meta-variable) stands for a particular string; 

(b) for any predicate appearing in a canon of form (2) , except 
for the canon derivable , there is exactly one canon of form (3) , 
i.es U,V,M are singleton predicates; 


U,V (in this order) are two singleton predicates appear- 


(c) if 
ing in a canon of form (2) , and if » , YW are the corresponding strings, 


_ then wp and V can jointly be put in the form 


p= Day 
v= Gu Y 


where ¢ ; y >» W are [meta-variables standing for] perticular strings, 


possibly empty, and A does not appear in a canon of form (4) 7 


the 4 other canons being: 


(5) +S derivable 
(6) f-» terminal string 
(7) x terminal ; y terminal string F xy terminal string 


(8) x derivable ; x terminal string f- sentence : 


of Type 068) and satisfies the additional condition that for each canon 


of form (2) the corresponding string w (defined in (c) ) is non-null. 


Definition 15. A simple canoni¢e system is of Type 2°°) if it is 
of Type 165) and satisfies the additional condition that for each canon 
of form (2) the corresponding strings > y (defined in (c) ) are 


null. 


of Type 268) and satisfies the additional condition that for each canon 
of form (2) the corresponding string w contains just two symbols, 
one terminal and one nonterninal (one for which there is a canon of form 


(4) and one for which there is no such canon), always in the same order. 


“sequential, left-,context-sensitive,, etc., simple canonic systems. 


Theorem 8 [Analogous to Theorem 1] . For any simple canonic system 


of Type i6®) (i= 0,1, 2,3.) there is a granmar of Type i which 


Proof. Similar to that of Theorem 1. 


The second part of Theorem 8 (the converse result) can be proved 


, 
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more easily if we use Theorem 1 and the following obvious Lemma: 


Lemma. The result of applying the procedure of Theorem H-2 upon 
a canonic system of Type i ( i=0, 1, 2, 3) is a simple canonic sys- 


tem of Type 168) . 


Theorem 8 provides us with a hierarchy of simple canonic systems 


with predicates of degree 1, that is a hierarchy of the form 
Sl sil $1 Sl ’ 


and the goal of the present chapter is thereby completely achieved. 
Before concluding this chpter, however, we should like to point out an 
interesting fact which provides a link between the two basic hierarchies 
developed in this chapter (the one of the form Sl Sl G2 82 
-- based on Doyle's -- and the other of the form G1 Gi Gl Gl , 
introduced in Chapter II). When we wanted to reduce the first basic 
hierarchy to one composed exclusively of simple canonic systems and no- 
ticed that its third class, the length-monitoring c.s. of Type Y » failed 
to be simple only because one of the canons used in monitoring string 
lengths included contextual referencing, we just replaced the offending 
canon. But there is absolutely no need for a canonic system to monitor | 
itself ‘the lengths of the strings. A context~sensitive grammar does not 
monitor the lengths of its strings, and it is no less noncontracting be- 
cause of this; strings grow in length not because the grammar monitors 
their lengths (which it does not) but just because the productions are 
noncontracting. When we examine the grammar ‘from the outside" (by 
using a meta-system) we.can prove that the strings are bound to grow; 


but there is no need to duplicate this proof inside the object system (the 
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grammar - or the: canonic system). We therefore: eliminate the canons 


Iv.(2)...(8), and the canonic system becomes now simple. It still con- 


tains canons of the form ‘ 


- < gAy - puy> production 
(one for each production PAyoqey ; Donovan & Doyle 1968, p. 43), 
and we just know that in each such canon fofer . This, however, does 
not yet solve our problem. We have to redefine the concept ‘c.s. of Type 
Y,' » OY, more exactly, to redefine the relation th 5; and this relation 


has to hold, sometimes, between two different variables, as for example, 


in 


(9) waz derived string 3<X, yo production; <y x>greater in length fwyz de- 


where we ought to be able to prove that xg@y .. For length-monitoring | 


¢.8. we could say that xqy because the premise < yx» greater in’ length 


is present (Definition 10.lc), but we do not have the predicate greater 


in length any more, and we are still under the obligation to ascertain, 


member of the class we define. One way to solve the problem.of eliminat- 


need to ever compare (in length) two distinct wariables. Then @would be- 
come an absolute relation, not devendent o& the canonic system, and defined 
by .la.lb.2a.2b.3. of Definition 10 (i.e. without. .ic..).. To achieve 
this end we have to replace canon (9) by as. many canons as there are pro- 
ductions, each new canon being the result of "plugging in'' a particular 


production in the canon (9) : 


(10) wed z derived string J wepuyz derived string 


63 


The class of canonic systems of Type Y (from the first basic hierar- 


1 
chy) is thereby transformed into a class which is, essentially, no 
different from the class of canonic systems of Type 1 (from the second 


basic hierarchy; Definition 3), and from here the whole second basic 


hierarchy is just one small step away. 


a a 
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CHAPTER VI 
NON- CROSS REFERENCING, SIMPLE, AND. NON. INSERTING CANONLC. : SYSTEMS. 


| CLASSIFICATION OF CANONIC SYSTEMS 


In this chapter we pursue an idea mentioned in Chapter I -- that 
one should not distinguish (and name) the subclass of canonic systems 
with contextual referencing, with insertions, with crossreferencing, 
the respective options. Canonic systems without contextual referencing 
(simple c.s.) were extensively studied in Chapters I and V; we shall now 
formally introduce the other two classes and investigate diiatg coments: 
tional power. 

Crossreferencing was defined [Donovan & Doyle 1968, p. 27] as con- 
sisting of the use of one and the same variable more than once in the 
term of the conclusion or the use of one and the same variable in more 
than one premise in the hypothesis. The possibility of a variable being 
used in exactly one premise of the hypothesis but decaueiis several times 
in that preaiae is he included in this definition. On the other hand, 
there is a fundamental difference between multiple occurrences in the 
hypothesis part of the canon and multiple occurrences in the conclusion. 
The applicability of a canon in a particular situation has to be esta~- 
blished before the canon could be used, and the applicability depends 
only on the hypothesis of the canon; if the hypothesis contains two 


occurrences of a variable, we have to check that the strings matched by 
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the two occurrences are identical ‘strings (substrings) , and this. checking 
is not an elementary action. Multiple dccurrences in the conclusion, 
however, have no influence sa the applicability of the canon. This argu- 
ment suggests that we should specifically exclude from the definition of - 


crossreferencing multiple occurrences in the conclusion, and include. 


The same point of view is taken by Turing (in connection with Turing 
machines) and by Minsky (in connection with Post's canonical systems). 


Quoting from Turing 1936 [p. 137 in Davis's collection] : 


"Tf, on the other hand, [the squares] are marked by a se- 
quence of symbols, we cannot regard thé process of recognition 
as a simple process. This is a fundamental point and should 
be illustrated. In most mathematical psepers the equations and 
theorems are numbered. ... But if the paper was very long, we 
might reach Theorem 157767733443477; thén, further on in the 
paper, we might find '... hence (applying Theorem 1577677334- 

3477) we have ...' . In order te mike: guee which was the re- 
levant theorem we should have to compare the two numbers figure 
by figure, possibly ticking the figures off in- ‘pencil to make. 
sure of their not being counted twice." . 


Minsky [1967, p. 231] remarks that he could have allowed multiple occur- 


rences of variables within any premise, but chose not to: 


"Post's most general formulation allowed each production to 
have several antecedents. ... Also in Post's most general for- 
mulation, he allowad two of the $*s in the antecédent to be the 
same. This meant that the rule of inference would apply only 
to a string (theorem) in witch there wisdn exact repetition 
of some (variable) substring in two places in the antecedent. 
We prefer to prohibit antecedents of this form, not because we 
want to restrict the generality of the systems, but because it 
would run counter to our intuitive picture of what ought to be ‘ 
permitted as elementary, unitary operations." , 


‘With this motivation (and backing) we change the definition of ‘crossrefe- 


rencing’ to read: 
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Detinitton 18; A canon 1s said to contain. crosereferencing if at. idast 


one of the variables involved in it occurs more than once in the hypothesis 
of the canon, whether these occurrences are within one premise or are in 


different premises. 


Definition 19. A canonic system is non-crossreferencing if none of 


its canons contains crossreferencing. 


Let us consider now the phenomenon of insertion, whose definition is 


implicit in the traditional definition of canonic systems with insertion 


as canonic systems in which terminal symbols are inserted between the 
variables of one string to form a new string. Since we:are interested in 
canonic systems without. insertions, we tentatively define canons without 
insertion as canons in whose conclusion no ijabele appedk: i.e. whose 
conclusions contain concatenations of wariables eather than concatenations 
of variables and words. The formal wedi cicatiane required in the defining 
second-level canonic system are not difficult to figure out, but the defi- 
nition would be forbiddingly restrictive: the axioms would be totally 
useless. In fact, we never defined axioms formally, but just referred by 


this name to any canon whose list of premises was empty, and therefore any 


restriction on the canons is automatically a restriction on the axioms. 
This suggests the following definition: 

Definition 20. A canonic system is non- inserting it it has the pro- 
perty that in all its canons, except for the axioms, the term in the 
conclusion of.the canon has only "pure" elements, i.e. each element is 


either a concatenation of variables or a concatenation of symbols. 


The following canonic systems will be used as examples: 


x theorem }- (x) 


x theorem |- 


ERED ith lat i pI oe, SAS a Gps ote 


Lo 


x()y theorem od xy 


1’. 
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Languege: set of balanced (well-formed) strings of 


ve{c, >} 


theorem 


theorem 


xx- theo rem 


theorem 


parentheses 


{Minsky p. 230] 


one- predica te ( Post) 
: nowcrossreferencing 


Same language, same alphabet. Lacenies p. 230] 


b O 


thegrem 


xy theorem fe «OY theorem 


“ simple 
one~predicate 
non-crossreferencing 


2. Language: palindromes over a , b,c . {Minsky p. 228] 


2', 


a 4 MO 


[> [> [> 


as 
E> eek 
{— aa A 
L- bb A 
}- cca 
[- axa A 
-— bxb A 
f- cxe A 


simple 
. one predicate 
‘non-crossreferencing 


- Same language. [Minsky p. 228] 


[> [> [> |> 


- a 
= b 
ec 
oes 
fbx 
— cxc 
[xx 


[> [> [> [> [> [> [> 


simple 


one- predicate 


non-crossreferencing 
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3. Language: all true statemants about adding l-ary positive integers 


{Minsky p. 229] (3 = '111' , etc.) 


v= fi,+,-} 


b-iti=1 add 
xtynz add Foxityesi add one-predicate 
xty=z add -- xtyl=zl add ‘pon-crossreferencing 


[or: xty=z add f-ytx=z add ] 
inserting 
not simple 
4. Language: all true statements about multiplying l-ary positive 


integers [Minsky p. 229] 


ve{i.-. =} 


t 1-i=l mult | one-predicate 
x-y=z mult [-xl-yezy mult non-crossreferencing 
X-y=z mult -y-x=z mult . inserting 
not simple 


5. Language: ‘Gas | m , . natural numbers } 


b-a A 
Pe 4 simple 
x A b ax A | non-erossreferencing 
x B L bx B 
x A; y Bh-xyxy sentence inserting 


5'. Same language. 


faa 


Fb B simple 
A;y A t- xy A. non- inserting 
B;y¥B |-xy B non-croasreferencing 
A;sy B 


f- xyxy sentence 


ERS AS ORR BB FOE LR Bieler A th 


6. Language: squares in t-ary. 69 


Alphabet = {1 , *f 


ft i A ) 
= non-crossreferencing 
x*y A [-- xll*yx A . 
; not simple 
xy A | ¥ gavare inserting 


6'. Same language. 
[Mentioned here for completeness; will be introduced later.] 


5'', Language: same as 5 . 


simple 
Fb ar non- insert ing 
x Aj; y Abry A . 
xBsy B xy B 
crossreferencing 
x A; y Bs 2 B - xyxz ABA 
Ae yy Bs B - xzyz BAB 
x ABA; x BAB t— x sentence 
5'". - Same language. 
faa 
x A fb oxA 
ae simple 
x B f~ xB 
ABA aaa bee inserting 
BAB crossreferencing 


x ABA ; x BAB - x sentence 


5! Language: Pa ara oe 


The first 6 canons of 5". 


Last canon replaced by ~ 


ax ABA ; x BAB’ x_ sentence 


5’. Same language. 


The first 4 canons of 5! . 


Last canon replaced by 


x. A 5 y B ° xyxy ABAB 


ax ABAB x sentence 
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natural numbers 


non- inserting 


_ not simple 


crossreferencing 


non-inserting 
non-crossreferencing 


not simple 


TLRS 
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of the classes of canonic systems considered catil Gow, only the 
two classes which correspond, in the firet basic hierarchy, to regular 
grammars and to context-free areas see non-crossreferencing. Since 
they are also simple and non- inserting, this implies that non-inserting 
c.8., non-crossreferencing c.s., and simple non-crossreferencing non- insert- 
ing c.s. are all powerful enough to define any context+free language. 
The following figure shows the new seabee 168 cuohale ayeteas (the numbers 


refer to our examples): 


Classification of canonic sys 


Introducing the abbreviations 72 
Q = non-crossreferencing ¢-8. 
R = non-inserting c.s. | 


S = simple c.s. 


QR = QfR 
Qk = Q\R etc. 
gee = (QA8)\s 
we. have 
[QRS] sg [CF] . (since (5) defines a non-, context- 
free, language ) 
| { [aR] =p {cr} 
aad [98] 2 { cr] As before, {class of eie.). = class of 


languages definable by the cless of. 
[85] => [finite intetsections of ‘CF languages] 
(ainee they. cag be obtained by cross- 


referencing; we note that the lan- 
guage of (3!) as ‘included in this 


Class) 
[Qh pe fcr) 
{R] wm [finite intersections of CF] [meeole improved 
{s] == {r.e.} . "he 


Since a c.s. in QRS can be trivielly modified so as to belong to 
Ors or QES or QRS , we also have : 


[QR] sp [cr] 
[QBs] ge (cr) 


Similarly, 


finite 
(Ga) s> [intersections of cH 
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[QRS] qe [CF] 


We shall now show that the non-inserting c.s. are powerful enough: 
to define any r.e. set: {[R]) =[r.e.] . As for the non-crossreferen- ’ 
cing c.s., we have not been able to improve the result stated above 
¢€ [Q) 2 [CF] ). It is known that non-crossreferencing c.s. with one 
predicate of order 1 cannot define the set of squares in l~ary (see 


[Minsky 1967, p. 235]). 


Proof. Any word (sequence of symbols) in the conclusion is replaced 
by a variable whose value is specified (by an addtional ptemise) to be 
in an (adequately defined) singleton prédicate. This implies that the 

‘desired reduction is possible. . | 


Remark. This procedure invariates the class of non-crossreferencing 


We shall now develop a complete hierarchy of non-inserting c.s.. 
Since the first two classes <com the first basic hierarchy (of the form 
Sl Si G2 S82) are already non- inserting, we shall bateds them and 
. adapt the last two to our purposes. . | | 
The most general non-inserting c.s. obviously generates an r.e. set; 


fining it (by Theorem 9). This gives us a class corresponding to Type 0. 


co ee orp eT: meee ee = 
TET Re TS ee RES Ses a SE SM STE ES RGSS SPE EST sel RSA a ROE 
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As for Type 1, we have a result completely similar to Theorem 7 (Ch. V). 
Before stating it, we need a few definitions. We would Like to talk 
about non-inserting length-monitoring c.s.; but all length-monitoring 


c.s. (as previously defined) include the [inserting] canon 


Iv.(5) <x. y> length ; 2 symbol xz. yl> length ‘ 


[A similar situation was encountered just before Definition 12.] By re- 


' placing this canon by 


<x, y> length ; z symbol; ui unit |-<xz _yu> length 
1 unit [singleton predicate] 


in the definition of 'length-monitoring' we arrive at the definition 


of r-length monitoring | cos. (similar to s-length-monitoring c.s.). 
Similarly, if we perform this replacement in the definition of 


'g-length-monitoring' , we arrive at the definition of re-length-monitoring 


canonic_ eyeteie. 


Theorep10- 


a) Given any context-sensitive grammar, one can uniformly effectively 


ROSH SHS SPSS ST SHEBRTSBH BESTEST TA TP Sewme SOSH KSO BBE Erenses ena awn 


construct a _non-inserting r-length-monitoring c.s. of ~2yPe_. X,__¢ Y, ) 


enna ae com wma nem me mwa othe cheater ed eee Lewawen 


defining - the s same languge. 


b) For any non-inserting r-length-monitoring c.8. of Type Y, ( YX, ) 


EO DD DD OP OD PN SF EE AY © OD AP OD OF OE FE EP ED ED OD OD OY ED GP ED OD HP OOH BY OE GF BP SP OD OD.) WE GD aD OE am OH ab aD a aD oe UD ee ae ae Jay OP oe ee 


the language defined by it is context-sensitive. Seine one can uniformly 


POs OS A OD OD. GP EY O-PS GD PAD. SP OD OD GD OD Om OD OD aS OD oe oe aD oD a DO OF GF EH OF Om FF OR SF Oe On OnE Or GD OR GD OE ON GP wD OD OD Oe GE ED oD oD 


effectively find a granmar_ for £2: 


* Definition similar to Def. 12. 
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We have thus obtained a hierarchy of non-inserting canonic systems. 


The form of this hierarchy may be described as 


RL Rl. R2 


R2 
+ 
RI 


(More exactly, RS1 RS1 RG2 RSI .) 


We shall npw define pure canonic systems. 
and non-inserting. {"pure'' since all concatenations contain either 


exclusively symbols or exclusively variables.] _ 
A -comptoce hiererchy of ‘pure c- 8. “y.0f 5he orm 


RSL RSl RS2~ = RS2 


RS1 ’ 


can be easily obtained, in a manner entirely similar to that in which 
the hierarchy of non-inserting c.s. was obtained. However, we prefer to 
present another hierarchy, based on the second basic hierarchy (form: 
Gl Gl Gl Gl ). At the end of Chapter IV, we found a hierarchy of 
simple c.s. with predicates of degree 1 : Sl Sl S81 Sl . 

(Cf. Def. 13, 14, 15, 16 and Theorem 8.) By inspecting the definitions 
of the classes of simple c.s. involved, it is easily seen that these 


canonic systems are also non-inserting. Therefore we have obtained: 
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Sl Sl Sl Sl 


o£ Theorem 8 is, actually, a hierarchy of pure canonic systems, 


4 


RSL RS1 RS1 RSL. & oui a 


The next logical step would be to look for a hierarchy of the form 
QRS1 QRS1 QRS1 QRS1 . We suspect that such a result is impossible to 
obtain, and, more seaciediy, that the non-crossrefereticing c.s. are not 
sufficiently powerful to define any language of Type 0 or 1. We shall 
now introduce a modification in their definitign, modification which will 
enable us to obtain a complete hierarchy. Following Minsky's [1967, p. 
235] definition for Post systems, we shall cal] cangnic system with auxi- 


liery alphabet a formal system.similar to ordinary 
Definition 22. fs toh ‘ } 


c.s. but in which a subset T of the alphabet V is 


defines is -* 
+ 
TAU ube. 4 
aed . 
rather than aN WG »A ) . The set VNT is called Supiliery alphabet. 


Systems with T= V may be identified with ordinary canenic systems. . 


The difference between canor!: systems with auxiliary alphabet and 
ordinary canonic systems becomes significant only in the case of 
non-crossreferencing canonic systems. For all other canonic systems we 
could define a predicate terminal string and then achieve the desired 


effect by adding a canon Like 


SESE REE ae Se EE RE MRRER ENS, PRR RE A ORS TS GREE RE NBS SpA Se pena RS HS oes 
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x sentential form; x terminal string ke x sentence ‘ 


Example. The set of squares in l-ary. 


6'. ve {i,*} 


a non-crossreferencing 
‘ { 1} one-predicate 
WITH AUZIL. ALPHABET 


Canons; - 1* A 
not sitple 
x*y A b x11 ya A. inserting 

myAbky A 


Examples 6 and 6' show that a set aay be undefinable by Post 
systems (canonic systems with one predicate of dagtie 1 and no auxili- 
ary alphabet) but become definable if we either allow one more predi- 
cate or allow an auxiliary symbol. This "trade-off" between additional 
predicates and additional (auxiliary) symbols is, in fact, an instance 


of a general result; 


Post systems 


more than one predicate (A) one predicate of degree 1 ; 


[(w.l.o.g.) of degree 1] | eel auxiliary symbols 


CANONIC SYSTEMS 


(c) 
(B) 


one predicate of 
higher degree 


(A): Trivial. 


(B)(C) : [Haggerty 1969, p. 44] * 


PR SET PI AES 
* Theorem 3. However, the statement of this theorem, “Any canonic 
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(B) may also be proved by using Theorem H-3 and the proof of Theorem 6b. 
{The number of the predicates will be the number of tapes of the LBA.] 


(C); Proved by introducing separators acting as auxiliary symbols. 


A-result similar to (B)(C) has been announced by N. Kohn [1969]; it in- 
volves variables which range on all but one of the symbols in the alphabet. 


Having defined and exemplified canonic systems with auxiliary alpha- 
bet, we are now ready to derive a hierarchy of non-crossreferencing ca~ 


nonic systems with auxiliary alphabet. 


zheoren_lz- 


a) Non-crossreferencing canonic_ systems with auxiliary alphabet 


are powerful enough to define any r.e._ set. 


ee ed SPO PRO e mm ae =~ oe ap oe en 


b) A_complete hierarchy of such canonic systems may be obtained from 


ae saowouee 


i 


the_second basic hierarchy. 


Proof. Obviously, it is sufficient to prove 'b)' . 7 
The only canon with croasreferencing in the canoniec systems from the 


above-mentioned hierarchy (Chapter II) is 


x dexivable ; x terminal string ‘ x sentence . 


By eliminating it from a given c.s. (together with the canons. which define 


the predicate terminal string ) and by replaging the axioms of the form 


i a terminal 


qualification "...which is a canonical extension [in Minsky's sense] of 
the given canonic system.", since there are formulas which are theorems 
in the Post system without being theorems in the given canonic system, 
and all such formilas contain auxiliary symbols not..in the alphabet of 
the given canonic system. The canonic systems 6 and 6' above are 
examples of systems which can not be simulated untess we allow canonical 
extensions. j no PRESS 


Bo ALOT TE nes BRE resin ema ey gaat Ee canst om cnet Ra eee ES 


ES ce ne St tty SE ES OPE 
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by a declaration 
tT=f{a,... } : 
we obtain a canonic system equivalent’ to the given one. The theorem 


now follows. 


OPEN PROBLEMS: 


l. "TQRS] = 2?" Find the computational power of the class of 
simple, non-crossreferencing, non- insert ing cenonic systems (no auxi- 


liary alphabet, any number of predicates). [ £ors)ptcr) 


2." [Q] =?" Find the computational power of the class of 
non-crossreferencing canonic systems (no euxiliary alphabet, any number 
of predicates). [Includes all finite intersections of context-free 


sets.] 


se [Q,] =?" Find the computational power of [unextended] 
Post systems (non-crossreferencing, no auxiliary alphabet, one predicate 


(necessarily of degree 1) ). 
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